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Approximating Sparse Covering Integer Programs Online 

Anupam Gupta* Viswanath Nagarajan^ 



Abstract 

A covering integer program (CIP) is a mathematical program of the form: 



o 

(N 1 min{c T x | Ax > 1, < x < u, x G Z™}, 

C$ ■ where A G i?> xn ,c, u G R> - In the online setting, the constraints (i.e., the rows of the 

constraint matrix A) arrive over time, and the algorithm can only increase the coordinates of x 
to maintain feasibility. As an intermediate step, we consider solving the covering linear program 
(CLP) online, where the requirement x G Z" is replaced by x G W 1 . 



Our main results are (a) an 0(log fc)-competitivc online algorithm for solving the CLP, and 
(b) an 0(log/c • log £)-competitive randomized online algorithm for solving the CIP. Here k < n 
and I < m respectively denote the maximum number of non-zero entries in any row and column 



c/3 ' of the constraint matrix A. By a result of Feige and Korman, this is the best possible for 



polynomial-time online algorithms, even in the special case of set cover (where A G {0, l} r 
and c,u G {0,1}™). 



The novel ingredient of our approach is to allow the dual variables to increase and decrease 
I/"") . throughout the course of the algorithm. We show that the previous approaches, which either 

only raise dual variables, or lower duals only within a guess-and-double framework, cannot give 
a performance better than O(logn), even when each constraint only has a single variable (i.e., 
k = 1). 



1 Introduction 



Covering Integer Programs (CIPs) have long been studied, giving a very general framework which 
captures a wide variety of natural problems. CIPs are mathematical programs of the following form: 

a 

min Ya=i °i x i ( IP1 ) 
subject to: Ya=i a ij x i ^ 1 V i G Mj i 1 - 1 ) 
0<Xi<Ui Vi€[n], (1.2) 
x 6 7L n . (1.3) 

Above, all the entries a>ij, Ci, and Uj are non-negative. The constraint matrix is denoted A = 
( a ij)i£\n],j£[m]- We define k to be the row sparsity of A, i.e., the maximum number of non-zeroes in 
any constraint j G [m]. For each row j G [m] let Tj C [n] denote its non-zero columns; we say that 
the variables indexed by Tj "appear in" constraint j. Let I denote the column sparsity of A, i.e., 
the maximum number of constraints that any variable i G [n] appears in. Dropping the integrality 
constraint ( |1.3| ) gives us a covering linear program (CLP). 
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In this paper we study the online version of these problems, where the constraints j € [m] arrive 
over time, and we are required to maintain a monotone (i.e., non-decreasing) feasible solution x at 
each point in time. Our main results are (a) an 0(log /c)-competitive algorithm for solving CLPs 
online, and (b) an 0(\ogk • log £)-competitive randomized online algorithm for CIPs. In settings 
where i € n or f C m our results give a significant improvement over the previous best bounds of 
O(logn) for CLPs []|], and 0(logn • logm) for CIPs that can be inferred from rounding these LP 
solutions. Analyzing performance guarantees for covering/packing integer programs in terms of row 
(k) and column (£) sparsity has received much attention in the offline setting, e.g. [|l5|, [L7], [LlJ 14, g|. 



This paper obtains tight bounds in terms of these parameters for online covering integer programs. 

Our Techniques. Our algorithms use online primal-dual framework of Buchbinder and Naor Jj]] . To 
solve the covering LP, we give an algorithm that monotonically raises the primal. However, we both 
raise and lower the dual variables over the course of the algorithm; this is unlike typical applications 
of the online primal-dual approach, where both primal and dual variables are only increased (except 
possibly within a "guess and double" framework — see the discussion in the related work section). 
This approach of lowering duals is crucial for our bound of O(logfe), since we show a primal-dual 
gap of Jl(logn) for algorithms that lower duals only within the guess-and-double framework, even 
when k = 1. 

The algorithm for covering IP solves the LP relaxation and then rounds it. It is well-known that the 
natural LP relaxation is too weak: so we extend our online CLP algorithm to also handle Knapsack 
Cover (KC) inequalities from ||. This step has an 0(log fc)-competitive ratio. Then, to obtain an 
integer solution, we adapt the method of randomized rounding with alterations to the online setting. 
Direct randomized rounding as in Jl| results in a worse O(logm) overhead, so to get the 0(log£) 
loss we use this different approach. 

Related Work. The powerful online primal-dual framework has been used to give algorithms for 
set cover Q, graph connectivity and cut problems [||, caching [|^, |||, ||, packing/covering IPs ||, 
and many more problems. This framework usually consists of two steps: obtaining a fractional 
solution (to an LP relaxation) online, and rounding the fractional solution online to an integral 
solution. (See the monograph of Buchbinder and Naor J?| for a lucid survey.) 

In most applications of this framework, the fractional online algorithm raises both primal and dual 
variables monotonically, and the competitive ratio is given by the primal to dual ratio. For CLPs, 
Buchbinder and Naor ||] showed that if we increase dual variables monotonically, the primal-dual 
gap can be 0(log amax ). In order to obtain an 0(log n)-competitive ratio, they used a guess-and- 
double framework |8|, Theorem 4.1] that changes duals in a partly non- monotone manner as follows: 

The algorithm proceeds in phases, where each phase r corresponds to the primal value 
being roughly 2 r . Within a phase the primal and dual are raised monotonically. But the 
algorithm resets duals to zero at the beginning of each phase — this is the only form of 
dual reduction. 

For the special case of fractional set cover (where A G {0, l} mxn ), they get an improved 0(log k)- 
competitive ratio using this guess-and-double framework || Section 5.1]. However, we show in 



Appendix A that such dual update processes do not extend to obtain an o(logn) ratio for general 
CLPs. So our algorithm reduces the dual variables more continuously throughout the algorithm, 
giving an 0(log /c)-competitive ratio for general CLPs. 



Other online algorithms: Koufogiannakis and Young [13 gave a /c-competitive deterministic online 
algorithm for CIPs based on a greedy approach; their result holds for a more general class of 
constraints and for submodular objectives. Our 0(log/clog£) approximation is incomparable to 



2 



this result. Feige and Korman |12[ show that no randomized polynomial-time online algorithm can 
achieve a competitive ratio better than 0(log k log I). 

Offline algorithms. CLPs can be solved optimally offline in polynomial time. For CIPs in the 
absence of variable upper bounds, randomized rounding gives an 0(log m)-approximation ratio. 
Srinivasan [15] gave an improved algorithm using the FKG inequality (where the approximation ratio 
depends on the optimal LP value). Srinivasan [16| also used the method of alterations in context of 
CIPs and gave an RNC algorithm achieving the bounds of [ 15] . An 0(log ^-approximation algorithm 
for CIPs (no upper bounds) was obtained in [17] using the Lovasz Local Lemma. Using KC- 



inequalities and the algorithm from |i7], Kolliopoulos and Young |Tl| gave an O (log ^-approximation 
algorithm for CIPs with variable upper bounds. Our algorithm matches this 0(}og£) loss in the 
online setting. Finally, the knapsack-cover (KC) inequalities were introduced by Carr et al. || to 
reduce the integrality gap for CIPs. These were used in jO], |l(]], and also in an online context by || 
for the generalized caching problem. 



2 An Algorithm for a Special Class for Covering LPs 

In this section, we consider CLPs without upper bounds on the variables: 

min Yh=i C i X i 
subject to: Ya=1 a ij x i — 1 Vj € [m], 
x > 

and give an 0(log fc)-competitive deterministic online algorithm for solving such LPs, where k is an 
(upper bound) on the row-sparsity of A = (a™). The dual is the packing linear program: 

Em 

subject to: J2JLi a ijVj ^ c i ^ € M> 

y>o 

We assume that Cj's are strictly positive for all i, else we can drop all constraints containing variable 
i. 

Algorithm I. In the online algorithm, we want a solution pair (x,y), where we monotonically 
increase the value of x, but the dual variables can move up or down as needed. We want a feasible 
primal, and an approximately feasible dual. The primal update step is the following: 

When constraint h (i.e., Yli a ihXi > 1) arrives, 

(a) define d ih = ^ for all i 6 [n], and d m ^ = min; d ih = min ie T h d ih . 

(b) while Ylii a ih x i < 1) update the x's by 

x ne W <_ A + ^PO \ x old + l_UK L ^ v . gIV 

V d ih J k ■ a ih d ih 

Let th be the number of times this update step is performed for constraint h. 

As stated, the algorithm assumes we know k, but this is not required. We can start with the estimate 
k = 2 and increase it any time we see a constraint with more variables than our current estimate. 
Since this estimate for k only increases over time, the analysis below will go through unchanged. 
(We can assume that A; is a power of 2 — which makes log k an integer; we will need that k > 2.) 
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Lemma 2.1 For any constraint h, the number of primal updates th < 2 log A;. 

Proof. Fix some h, and consider the value i* for which di*h = d m ^y In each round the variable 
xi* <— 2xi* + l/(k ■ ai*h)', hence after t rounds its value will be at least (2* — l)/(fc • ai*h)- So if we 
do 2 log A; updates, this variable alone will satisfy the h th constraint. ■ 

Lemma 2.2 The total increase in the value of the primal is at most 2thd m i h y 



the increase in each variable i G Th is -x° ld + t-^- — So the increase in the primal objective 



Proof. Consider a single update step that modifies primal variables from x old to x new . In this step 

d, 

is: 

'dm(h) /d 1 d m{ h) 

Xj ~~\~ 

i£T h 



dih k ■ aih 



d m (h) Yl ^ ■ X t d + d m{h) ' — ^ ' dm{h) 



The inequality uses |T^| < k and S«eT h a «fr ' x t — 1 which is the reason an update was performed. 
The lemma now follows since th is the number of update steps. ■ 

To show approximate optimality, we want to change the dual variables so that the dual increase 
is (approximately) the primal increase, and so that the dual remains (approximately) feasible. To 
achieve the first goal, we raise the newly arriving dual variable, and to achieve the second we also 
decrease the "first few" dual variables in each dual constraint where the new dual variable appears. 

For the h th primal constraint, let d^, d m ^h),th be given by the primal update process. 

(a) Set y h <- d m{h ) ■ t h . 

(b) For each i G Th, do the following for dual constraint ^ ■ a^yj < q: 

(i) If Y,j<h a ijyj - (!01ogA;)Q, do nothing; else 

(ii) Let k{ < h be the largest index such that Ylj<ki a ijyj — (5 log A;) q; let Pi = 
{j < ki | % G Tj } be the indices of these first few dual variables that are active 
in the i th dual constraint. For all j G Pi, 

new , f-i _ dm(h) \ M 

V dih )' Vj ■ 

Observe that the dual update process starts each dual variable yj off at some value d m (j\tj and 
subsequently only decreases this dual variable, and that the dual variables remain non-negative. 

Lemma 2.3 When primal constraint h arrives, the left-hand- side of each dual constraint i increases 
due to the variable yh by • d m t h \ ■ th < (2 log A;) Cj. 



Proof. We set the initial value of the dual variable yh to d m (h\ ■ th- By Lemma 2.1, th < 2 log k. By 
definition, d m /h) < c%jaih- Hence, for any i G Th, the increase in the left-hand-side of dual constraint 
i is at most aih • (2 log A;) (cj/a^) = (2 log A:) c%. This proves the lemma. ■ 



Lemma 2.4 When primal constraint h arrives, if the dual update reaches step b(ii) for some i G Th, 
then ki is well-defined and the set Pi is non-empty; moreover, — ^ logfe^' ^ P> hi- 
proof. For each j < hwe have yj < 2 log k ■ d m r^ , since dual variable yj was initialized to tjd m ^ < 



2 log k ■ d m (j) (by [Lemma 27T|) and subsequently never increased — so aij ■ yj < 2 log k ■ d m u\ ■ < 
2 log k-Ci, using d TO y) < dij = Ci/a{j. If the dual update reaches step b(ii) then we have 2^,j<h a ijVi > 
(10 log k) Ci, but each j < h contributes at most 2 log A: • q, so ki is well-defined, and Pi is non- 
empty. Moreover, by the choice of ki, we have Ylj<ki+i a vyj > (5 log A;) c^, so ^2j <k . &ijyj > 
(5 log k) Ci - a^ki+i ■ Vki+i > (3 log k) ■ Ci, as claimed. ■ 
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Lemma 2.5 After each dual update step, each dual constraint i satisfies Ylj a ijyj — (12 log k) Cj. 
Hence the dual is (12 log k) -feasible. 

Proof. Consider the dual update process when the primal constraint h arrives, and look at any 
dual constraint i € T/, (the other dual constraints are unaffected). If case b(i) happens, then 
by [Lemma 2.3| the left-hand-side of the constraint will be at most (12 log k) a. Else, case b(ii) 
happens. Each yj for j € Pj decreases by yj ■ d m ^/dih, and so the decrease in ^2j €P . a-ijVj is at 



least ^2j £ p. ct-ijUj ■ (d m (h) / dih) ■ Using Lemma 2.4 , this is at least 



%^ • Cj (3 log k) = dm(h) ■ a (3 log k) = d m{h) ■ a ih ■ (3 log k). 

dih Ci/a ih 

But since the increase due to yh is at most • d m t h \ th < dih ' d m i h \ ■ (2 log A;), there is no net 
increase in the LHS, so it remains at most (12 log k)ci. ■ 

Lemma 2.6 The net increase in the dual value due to handling primal constraint h is at least 
2 d m {h) ' th ■ 

Proof. The increase in the dual value due to y^ itself is d m (M • t^. What about the decrease in the 
other yj's? These decreases could happen due to any of the k dual constraints % G Th, so let us focus 
on one such dual constraint i, which reads Ylj-ieTj dijUj < <H- Now for j < h, define 7^ := j^j—- 
Since yj was initially set to tj d m ^ < tj dij and subsequently never increased, we know that at this 
point in time, 

In < %^ < I- (2-4) 

a t j 

The following claim, whose proof appears after this lemma, helps us bound the total dual decrease. 

Claim 1 If we are in case b(ii) of the dual update, then ^2j e p. -^0- < ^ ■ 
Using this claim, we bound the loss in dual value caused by dual constraint i: 

Edm(h) ^m(fc) \ , . d m (h\ v -» 

j6P . ^ j€P . ci/a ih jePi 

-A *i / 1 1 , 11 d m{h) 

— a m{h) a ih ■ > Jij ■ S( Claim 1 ) a m{h) a ih ' TTT ' — Q , ■ 



Summing over the |Tj| < fc dual constraints affected, the total decrease is at most \d m ^) < \d m (h-\th 
(since there is no decrease when th = 0). Subtracting from the increase of d m n^ ■ th gives a net 
increase of at least \d m ^th, proving the lemma. ■ 



Proof of Claim 1: Consider the primal constraints j such that i 6 Tj'. when they arrived, the 
value of primal variable Xi may have increased. (In fact, if some primal constraint j does not cause 
the primal variables to increase, yj is set to and never plays a role in the subsequent algorithm, 
so we will assume that for each primal constraint j there is some increase and hence tj > 0.) 

The first few among the constraints j such that i 6 Tj lie in the set Pj: when j 6 Pj arrived, we 

added at least to a?j's valueQ, and did so tj times. Hence the value of xi after seeing the 

constraints in Pj is at least £ iePi > £. £Pj jgg, using Q. 



1 More precisely, Xi increased by at least k . x ^*. (,) where kj < k was the estimate of the row-sparsity at the 
arrival of constraint j, and k is the current row-sparsity estimate. 
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If x-i is the value of Xi after seeing the constraints in Pj, and Xi is its value after seeing the rest of 
the constraints in Qi : = ({j < h \ i E Tj} \ Pi). Then 



V * II 1 + ^ ^(0 IK 1 + ^ ^<U e 1 ^^ > 2fc 2 . (2.5) 
The last inequality uses the fact that k > 2, and that: 

jeQi jGQi jeQi 1 % \j<h jePi j 



where the inequality is because we are in case b(ii) and X^jeP a *i%' — (51og/c) • Cj by Lemma 2.4 



Finally, when doing the primal/dual update steps for constraint h, the value of X{ just before this 
must have been x[ < (otherwise constraint h would have already been satisfied just by variable 

xi). And Xi is a t least YljePi FcT 1 ' the first calculations. And x'i/Xi — 2/c 2 by ( p.5[) . Putting 
these together gives 

7ij*j < J 1_ 

and hence the claim. ■ 



Lemma 2.6 and Lemma 2.2| imply that the dual increase is at least 1/4 the primal increase, and 
Lemma 2.5 implies we have an 0(log /c)-feasible dual, implying the following theorem: 

Theorem 2.7 Algorithm I is an O (log k)- competitive online algorithm for covering linear programs 
without upper-bound constraints, where k is the row- spar sity of the constraint matrix. 



3 The Online Algorithm for CIPs 

We now want to solve CLPs with variable upper bounds, en route to solving general CIPs of the form 



(1P1). However, it is well-known that when we have variable upper-bounds, the natural relaxation 
has a large integrality gap even with a single constraint] 2 ] Hence, Carr et al. @ suggested adding the 
knapsack cover (KC) inequalities — defined below — to reduce the integrality gap significantly. In this 
section, we first show how to extend Algorithm I to get an 0(log fc)-competitive algorithm for the 
natural CLP relaxation (with upper bounds) where we also satisfy some suitable KC inequalities. 
Next, we round (in an online fashion) such a fractional solution to get a randomized 0(log£ ■ log k)- 
competitive online algorithm for general fc-row-sparse and ^-column-sparse CIPs. 

Knapsack Cover Inequalities. Given a CIP of the form ( |[P1] ), the KC-inequalities for a particular 
covering constraint ^ieM a ij x i — 1 are defined as follows: for any subset H C [n] of variables, the 
maximum possible contribution of the variables in H to the constraint is aj(H) := dijUi, and 

if dj{H) < 1 then at least a contribution of 1 — dj(H) must come from variables [n] \ H. Moreover, 
in any integral solution x, since each positive variable Xj is at least one, we get the inequality: 

T, ie [n]\H m[n { a ij^ 1 - a j( H )} ■ x i > l-aj(H) (3.6) 
Since (|3.6| ) is not be true for an arbitrary fractional solution satisfying 



Sie[n] a ij x i — 1) we ac ^ this additional constraint to the LP, for each original constraint j and 



2 The trivial CIP min{a;i | Mxi > 1} has integrality gap M, no upper bounds needed. However, if we truncate the 
aijS to be at most 1 (which is the right-hand-side value), and we have no upper bound constraints, this gap disappears. 
Introducing upper bounds brings back large integrality gaps, as the example min{xi \xi + (1 — e)i2 > 1, X2 < 1} shows, 
which has an integrality gap of 1/e. 
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H C [n] where dj(H) < 1. There are exponentially many such KC-inequalities, and it is not known 
how to separate exactly over these in poly-time^. But as in previous works || 11 , [| , the randomized 
rounding algorithm just needs us to enforce one specific KC-inequality for each constraint j — namely 
for the set Hj := {i G [n] \ Xi > r -u{\ with some suitable threshold r > 0. We call this the "special" 
KC-inequality for constraint j. 

3.1 Fractional Solution with Upper Bounds and KC-inequalities 

In extending Algorithm I from the previous section to also handle "box constraints" (those of 
the form < xi < Uj), and the associated KC-inequalities, the high-level idea is to create a 
"wrapper" procedure around Algorithm I which ensures these new inequalities: when a constraint 
X^eT- a ij x i — 1 arrives, we start to apply the primal update step from Algorithm I. Now if some 
variable x p gets "close" to its upper bound u p , we could then consider setting x p = u p , and feeding 
the new inequality X/jgTj\p a ij x % 

> 1 —cipjUp (or rather, a knapsack cover version of it) to Algorithm I, 
and continuing. Implementing this idea needs a little more work. For the rest of the discussion, 
r G (0, 5) is a threshold fixed later. 

Suppose we want a solution to: 



(IP) min < 



^^CjXj I a ij x i > 1 Vj G [m], < Xi < Ui, x,i G Z Vi G [n], 
where constraint j has \Sj\ < k non-zero entries. The natural LP relaxation is: 

(P) min ^ CiXi \ a-ijXi > 1 Vj G [m], < Xi < Ui Vi G [n] 



Algorithm ^1 finds online a feasible fractional solution to this LP relaxation (P), along with some 
additional KC-inequalities. This algorithm maintains a vector x G R n that need not be feasible for 
the covering constraints in (P). However x implicitly defines the "real solution" x G M. n as follows: 

Xi if Xi < TUi y,, r 

tij otherwise 

Let x^) and xy' denote the vectors immediately after the j th constraint to (IP) has been satisfied. 



Theorem 3.1 Algorithm 3.1 , given the constraints of the CIP (IP) online, produces x (and hence 
5c) satisfying the following: 

(i) The solution x is feasible for (P). 

(ii) The cost J2l=i c i ' x i = 0(\ogk) ■ opt /P . 

(Hi) For each j G [m] let Hj = {i G [n] | xp > r • u{\ and aj(Hj) = J2reHj a rjUr- Then the 
solution x^') satisfies the KC-inequality corresponding to constraint j with the set Hj, i.e., if 
aj(Hj) < 1 then: 

EieSA^i min 1 ~ a i(- H i)} ' x * 0) - 1 ~ a J'( i ^)- 
Furthermore, the vectors x and x are non- decreasing over time. 



3 KC-inequalities can be separated in pseudo-polynomial time via a dynamic program for the knapsack problem. 
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Again, the value of row-sparsity k is not required in advance — the algorithm just uses the current 
estimate as before. 

The solution x to (P) is constructed by solving the (related) covering LP without upper-bounds — the 



constraints here are defined by [Algorithm 3,1 
(P') 



mm < 



OiihXi > 1 VVi G [m'j, Xi > Mi G [n] 



At the be ginning of the algorithm, h — 0. When the constraint for (IP), namely X^ies- ^ij-^i 

> 1 



arrives online, the algorithm generates (potentially several) constraints for (P') based on it. Claim 2 
shows these are all valid for (IP), so the optimal solution to (P') is at most opt^p. 

Algorithm 3.1 Online covering with box constraints 
When constraint j (i.e., YlieSj a v ' x i — -0 arrives for (P), 
1: set h h + 1, th 0, Fj <— {i G Sj : > tu,,}, Sj \ Fj. 

2: set 6 «- 1 - Yji&Fj a ij u i> and a ih <- min {l, , Mi G T^, and 0% = 0, Vi ^ T fe . 
3: if 6 > then generate constraint Sier h ^i/i 3 -* — 1 f° r (-f ') e l se halt. 

// lfb<0 then constraint j to (P) satisfied 
4: while (S ieTfc a% • a* < 1) do 

// start primal-update process for h th constraint (X)ieT a ih ' Xi>l) to (P'). 
if T h = 0, return infeasible. 



define dih ■= for all i S [nl, and d. 



m(h) 



min,- d 



i Uih 



min ieTh d ih 



define S < 1 to the maximum value in (0, 1] so that: 



max 



1 

Ui 



1 + 5 



d 



m ( h ) \ old 
. x i 



d 



m(h) 



k ■ ctih d ih 



< 



9: perform an update step for constraint h as: 



1 + 5 



d 



d 



ill 



Mh) \ old 

dsn 



+ 



5 d 



m(h) 



10 
11 
12 
13 
14 
15 
16 

17: 



k ■ a ih d ih 

j jXi = Ui - 



Mi G T h . 



i € Fi 



set th <— th + 5. 

let F h ^{i£T h :xi = tv*} and Fj <- Fj [j F' h . 
if [F' h ^ 0) then 

// constraint h to (P') is deemed to be satisfied and new constraint h + 1 is generated. 
set h <— h + 1, th <— 0, and Th <— Sj \ Fj. 

set b <- 1 - X^ig Fj °ii' u i5 a ^ = min {l, , Mi G T h and = 0, Mi g" T h . 
if b > generate constraint X^ieT h GW^i ^ 1 f° r (P')'i e ^ se halt. 
//Ifb<0 then constraint j to (P) satisfied 
end if 



18: end while // constraint j to (P) is now satisfied. 



Clearly x G [0, u]; it is feasible for (P) because (a) we increase variables until the condition in line 4 
is satisfied, and (b) if h denotes the current constraint to (P') at any point in the while-loop, the 
following invariant holds: 

Solution X satisfies constraint h to (P')> *-e. Y2i a ih ' Xi > 1, 
==> X satisfies constraint j to (P), i.e. aij ■ Xi > 1. 



S 



By construction x and x are non-decreasing over the run of the algorithm. Finally, for property (hi), 
note that the condition of the while loop captures this very KC inequality since Th = {i G Sj : X{ < 
t ■ Ui} at all times. 

To show property (ii), we use a primal-dual analysis as in Section [| we will show how to maintain 
an 0(log &;)-feasible dual y for (P 1 ), so that c • x is at most 0(1) times the dual objective X^ fegfm'l Dh- 
This means c • x < 0(log k)opt P , < 0(log k) ■ opt^p, with the last inequality following from |Claim 2 
below. 

Claim 2 The optimal value for the LP (P') is at most optjp, the optimum integer solution to (IP). 

Proof. We claim that every inequality in (P') can be obtained as a KC-inequality generated for 
(IP). Indeed, consider the h th constraint YlieT a ih x i > 1 added to (P'), say due to the j th 
constraint YlieSj a-ij ■ Xi > 1 of (IP). Here Th = Sj \ Fj for some Fj C Sj, and = minjl, ^j 2 -} 
for i £ Tj, with 6 = 1 — X^reF, a n ' u r > 0- ^ n other words, the h th constraint to (P r ) reads 

which is the KC-inequality from the j th constraint of (IP) with fixed set Fj. Now since all KC- 
inequalities are valid for any integral solution to (IP), the original claim follows. ■ 

Now to show how to maintain the approximate dual solution for (P 1 ), and bound the cost of the 
primal update in terms of this dual cost. The dual of (P') is: 




(D 1 ) max < 



I ^2 a ih -y h < Ci Vz G [n], y h >0 Vj G [m] 



The dual update process is similar to that in Section % When constraint h to (P') is deemed 



satisfied in line 13, update dual y as follows: 



Let dih, d m fh) >th be as defined in Algorithm 3.1 

(a) Set y h <- d m ^ ■ t h . 

(b) For each dual constraint i s.t. i G Th (i.e., YlneTi a UVl — c «)> d° the following: 

(i) If Y.i<h a nyi - ( 10 log /c)q, do nothing; else 

(ii) Let hi < h be the largest index such that ^z<fc a uUi ^ (5 log A;) c^; let Pj = 
{I < ki | i G 7]} be the indices of these first few dual variables active in dual 
constraint i. For all / G Pj, set 

<~ (l - min{l, t h } ■ ■ yf d . 

The only difference from Section § is to change ^1 — -gr~ ^ to 

1 — min{l, th} dr ^^ ^ ; this is because maintaining < TUj required us to be cautious and introduce 
the damping factor of 5 G (0, 1] in the primal update, hence th could be much smaller than one. 



Here too, each yh starts off at d m ^th, and only decreases thereafter. Similar to Lemmas 2.1 and 2.2 
we get: 

Lemma 3.2 For any constraint h to (P') ; the value th < 2 log A;. 
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Proof. (Sketch) Each time th increases by 1, the process behaves as before, so if we perform a 
primal increase step then th is an integer strictly less than 2 log k (itself an integer since we assumed 
A; is a power of 2). Also, the first time that th increases by 5 < 1, the algorithm adds at least one 
variable to F h , fixes th and moves on to a new constraint h + 1. ■ 

In the rest of the proof, we omit details that are repeated from Section ^, and only point out 
differences, if any. 

Lemma 3.3 The total increase in Ylie[n] Ci ' Xi ^ ue t° updates for constraint h is at most 2 th (i m (/i) ■ 

Lemma 3.4 In the dual update for constraint h to (P'), variable yh increases the left-hand- side of 
each dual constraint i by ctih ■ d m ^h) ' th < (2 log A;) • Cj. 

Lemma 3.5 If the dual update for constraint h to (P') reaches step b(ii), then ki is well-defined 
and the set Pi is non-empty; moreover, —^j^pr- £ [3 ... 5]. 

Lemma 3.6 After each dual update step, the dual is (12 log k) -feasible; i.e. each dual constraint 
Y,i a uVi < (12 log k) Ci. 

Proof. As in the proof of [Lemma 2.5 , consider the update due to constraint h to (P') and the 



;th 



dual constraint for some i € Th- If we are in case b(i), Lemma 3.4 implies that ^2[ttuyi < 
(10 log k)c{ + (2 log k)ci. For case b(ii), the decrease in the left-hand-side YlieP- auyi of constraint 
i is at least min{l,t/ l } • ^2 ieP . auyi ■ (d m ( h ) / d ih ) ■ By |Lemma 3.5| the sum ^2 leP . auyi > c-i (31og fc) 
and hence the reduction in the left-hand-side of dual constraint i is at least 



min{31og k, t h } 



d 



m(h) 



d m (h) ■ ®ih ■ min{3 log fe, t h } > d m(h) • a ih ■ t h . 



The inequality uses Lemma 3~^ . Combined with Lemma 3.4 it follows that there is no net increase 
in the left-hand-side. Hence we can maintain the invariant that it is at most (12 log A;) Cj. ■ 

Lemma 3.7 The net increase in dual value due to handling constraint h to (P r ) is at least ^ d m ^) ■ 
th- 



Proof. The increase in the dual value due to yh is d m nA ■ th- As in Lemma 2.6 , let us bound the 
decrease in the other y{s. Consider any of the k dual constraints i 6 Tj,. Again define ^fu := 
for I < h; since y\ started off at t\ ■ d m ^ and never increased, we have ju < d m ^/du < 1. Again, as 
in 



Claim 1 



Claim 3 If we are in case b(ii) of the dual update, then ^zep — ' 



Using calculations as in Lemma 2.6, the decrease in dual objective due to dual constraint i is: 



minjl, t h j ■ 2^ — } Vi 



< 



2), d m (h) ■ 



min{l, t h } 



< 



1 

2k- 



d m (h) • th- 



Since there are |T^| < k dual constraints we have to consider, the total decrease is at most \d m ^h) th- 
Subtracting this from the total increase of d m nA ■ th gives the lemma. ■ 

Comparing Lemma 3.7 with Lemma 3.3j , while handling the h th constraint in (P') the increase in 
the dual objective function is at least 1/4 of the increase in the primal objective function c-x. And 
[Lemma 3.6j tells us that y is an (9(log /c)-feasible dual to (P'). Hence: 

C-X < 4(1 • y) < we ak duality 0(log k) ■ Optp, < \jlaj m | 0(log k) ■ Opt /p . 



This completes the proof of property (iii) in Theorem 3.1 



10 



3.2 Online Rounding 



We now complete the algorithm for CIPs by showing how to round the online fractional solution 
generated by [Theorem 3.1 also in an online fashion. This rounding algorithm also does randomized 



rounding on the incremental change like in H, but to get a loss of 0(log£) instead O(logm), we use 



the method of randomized rounding with alterations p, 16]. Recall £ < m is the column-sparsity 



of the constraint matrix A — the maximum number of constraints any variable Xi participates in. 



(The O(log^) bound for offline CIPs given by [17, 11] uses a derandomization of the Lovasz Local 



Lemma via pessimistic estimators, and is not applicable in the online setting.) 



Given that the constraints of a CIP arrive online, we run Algorithm 3.1 to maintain vectors x and 



x with properties guaranteed by [Theorem 3.1 . For this section, we set the threshold r to g • 



x — x _^ i — i 1 ••- — — -- « log* 

Before any constraints arrive, pick a uniformly random value pi € [0, 1] for each variable i € [n]— 
this is the only randomness used by the algorithm. We will maintain an integer solution X € Z>o> 
again let X^ denote this solution right after primal constraint j has been satisfied. We start off 
with X^ ) = 0. When the j th constraint arrives and the (fractional) Xi values have been increased 
in response to this constraint, we do the following. 

1. Define the "rounded unaltered" solution: 

if Xi < rpi 

\xi/r] \i t p,i < xi < Tm , Mie[n]. 

lij if Xi > TUi 

2. Maintain monotonicity. Define: 

X? ew = max{xf Zi}, Mi G [n]. 

Observe that this rounding ensures that Xi € {0, 1, . . . , Ui} for all i € [n]. 

3. Perform potential alterations. If we are unlucky and the arriving constraint j is not satisfied by 
X new , we increase X ne1 " to cover this constraint j as follows. Let Hj := {i G [n] \ x± > r • Ui} 
be the frozen variables in the fractional solution; note that Zi = m for all i 6 Hj, so these 
variables cannot be increased. Recall that aj(Hj) := YlreH a rj • u r . Since constraint j is 
not satisfied, aj(Hj) < 1 and the algorithm performs the following alteration for constraint j. 
Consider the residual constraint on variables [n] \ Hj after applying the KC-inequality on Hj , 
i.e. 

minjajj, 1 — aj(Hj)} ■ Wi > 1 — aj(Hj). 

ie[n]\Hj 

Set ciij = min 1 1, l _^ H j for all i € [n] \ Hj. Consider the following covering knapsack 
problem: 

nlin Ei6[n]\ff 3 d ■ W i ( IP I<) 

subject to: J2ie[ n ]\H 3 «y ' w i > 1 

< u>i < Ui , Vi G [n] \ Hj 
wi eZ, Mi e [n] \ Hj 

Note that there is only one covering constraint in this problem. Let W denote an approxi- 
mately optimal integral solution obtained by the natural greedy algorithm. It is clear that W 
satisfies the residual constraint j on variables [n] \ Hj. Define X^ J ^ as follows. 

(, )= f X?™ ioiitH, 

\ max{XV- ew , Wi} for i € [n] \ Hj 
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This completes the description of the algorithm. By construction, it outputs a feasible integral 
solution to the constraints so far, so it remains to bound its expected cost. 

Remark: This algorithm does not require knowledge of the final column-sparsity I in advance. At 
each step, we use the current value of I. Notice that this only affects r and the definition of Z. 
However, for fixed values of Xj and pi (any i G [n]) the value of Zi is non-decreasing with I: so 
vector Z is monotone over time (since I is non-decreasing). We also require a slightly more general 
version of [Theorem 3.1 where we have multiple thresholds t\ < T2 < • • • < r m and replace r by Tj 
in condition (iii). This extension is straightforward and details are omitted. 

Cost of Z. Consider the rounding algorithm immediately after all m constraints have been satisfied. 
If Xi/r G [0,1], then E[Zj] = Pr[pj < x%/t] = Xi/r; if Xi/r > 1, then Zi < \xi/r~\ < 2xi/r with 
probability 1. Hence: 



E 



< 



(2/r) ^2 c i x i 



0(\ogk- \og£) ■ opt JP , 



where we use 1/r = 0(log£), and Theorem 3.1| (ii) to bound Y2i°i x i- 

Cost of X — Z. To account for X — Z, we need to bound the expected cost of any alterations. In 
the sequel, let £j, kj and Tj denote the respective values of £, k and r at the arrival of constraint j. 
When j is clear from context we will drop the subscript. 

Recall that Hj := {i £ [n] \ x£ > Tj ■ u{\ are the frozen variables in the fractional solution after 

handling constraint j, and note Zi = Ui for i 6 Hj. Define Aj := {i € [n] \ xf' < Tj}. Note 
that the randomness only plays a role in the values of {Zi \ i E Aj}, since all variables in [n] \ Aj 

deterministically are set to Zi = min | [a^/Vjl i f»J- Let £j denote the event that an alteration was 
performed for constraint j. The event £j occurs exactly when YlieM a ij ' Xf ew < 1. Since variables 
r € Hj have X™ ew = Z r = u r with probability 1, event £j is the same as aj(Hj) < 1 (which is a 
deterministic condition) and X)ie[n]\Hj a *i ' 

Xf ew < l-aj(Hj). 



Lemma 3.8 The probability of an alteration for constraint j is Pr[£j] 



< 



Proof. Let 6 = 1 — aj(Hj), for £j to occur we have b > 0. Set = min{aij/b, 1} for i G [n] \ Hj. 
Now since Z < X and both are integer- valued, Pr[£,-] 



Pr 



ie[n]\i?3 



< b 



< Pr 



E 

iG[n]\i?3 



■ Zi <b 



= Pr 



Theorem 3.1| (iii) guarantees that X^iG[n]\^ a ^ ' x ^ — ^' ^- mon S * e W \ 



Zi = \x£ /t] deterministically for i G [n] \ (iTj U ^4j), and 
Zi G {0, 1} with K[Zi] = x^/t independently for i G Aj. 



So E 



zZielnWHj a ij ' 



> -. Now Chernoff bound implies for a collection of [0, l]-valued indepen- 



dent random variables, that the probability of their sum being less than r = 1/(8 log lj) times their 
expectation is at most ■ 



Lemma 3.9 Conditioned on £j, the cost of incrementing X 11 ™ to X^' is at most 36 X^es c « ' 
here Sj C [n] are the non-zero columns in constraint j. 



„(i) . 
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Proof. The fractional solution satisfies the KC inequality for set Hj, by Theorem 3.1(iv). In 
particular, setting w[ = xf' for i G Sj\Hj (and zero otherwise) gives a feasible fractional solution to 



the LP relaxation of the covering knapsack subproblem ( IPjA ). It suffices to show that the greedy 



integral solution W to ( \[Pk ) costs 36^ i& y. c% • w[. It is crucial that w[ < r • Ui < Ui/2 for all 



i G [n] \ Hj, as in general the integrality gap due to relaxing (IPr) is unbounded. 

The greedy algorithm orders columns i G [n] \ Hj in non-decreasing Ci/aij order, and increases Wi 
variables integrally (up to their u^s) until Yli®ij • Wi > 1. Since all a^- < 1, it is easy to show that 
this algorithm achieves a 2-approximation for covering knapsack ( | JPy| ) . 

To complete the proof, we show the optimal integral solution to ( [fP^D costs at most 18 X^ieSj Ci ' w i : 
we give a rounding algorithm to obtain an integral solution W' from w' with only a factor 18 increase 
in cost. Set W[ ~ Binom(iij, 2w[/ui) for all i G [n] \ £/j — this definition is valid since w[ < U{/2. 
Clearly W always satisfies the upper bounds Uj and has expected cost 2 c-w'. Moreover, each W[ is a 
binomial r.v. and < 1, so J2i ®ij ' W/ can be viewed as a sum of independent [0, l]-valued random 
variables. The expectation E[^jajj • W[] > 2, so a Chernoff bound gives Pr[^i^ii ' W/ < 1] < 
8/9. Using Markov's inequality, Pr [c • W > 18 c-w'] < 1/9. So with positive probability, W 
satisfies ( \IPk ) an d costs at most 18c • w', showing that Opt( /Px| ) is at most this cost. ■ 



Thus the total expected cost of alterations after m constraints is: 



j=i ieSj i=i i ieSj i=i y^eSj J 



The second inequality uses the monotonicity of the fractional solution x, and the third inequality 
uses that for any i G [n], the value £j is at least q upon arrival of the q th constraint containing 
variable i. 

Combining the expected cost of 0(c • x) for the alterations with the expected cost of 0(log £) ■ (c ■ x) 
for the initial rounding, and [Theorem 3.1| (ii), we get the main result for this section: 

Theorem 3.10 There is an 0(logk ■ log £) -competitive randomized online algorithm for covering 
integer programs with row-sparsity k and column- spar sity i. 

Again, we note that the algorithm does not assume knowledge of the eventual k or £ values; it works 
with the current values after each constraint. Furthermore, the algorithm clearly does not need the 
entire cost function in advance: it suffices to know the cost coefficient q of each variable i at the 
arrival time of the first constraint that contains i. 
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A Limitations of the Guess-and-Double Approach 

We observe here that previously used primal-dual updates (to the best of our knowledge) are insuf- 
ficient to prove a competitive ratio that depends only on k. A large number of online algorithms are 
based on monotone primal-dual updates. Buchbinder and Naor [H, Lemma 3.1] showed that if we 
maintain monotone duals then the primal-dual gap may be as large as J) (log t £ nax ) . In order to get 
around this issue and obtain an 0(log n) competitive ratio for general covering LPs, |, Theorem 4.1] 
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used a guess-and-double framework which uses duals in a partly non-monotone manner. However, 
as we show below, this scheme does not suffice to obtain a primal-dual gap independent of n, even 
when k = 1. 

The guess-and-double scheme proceeds in phases, and within each phase it maintains monotone 
primal as well as duals. But when the phase changes, the scheme resets all dual values to zero and 
starts afresh; this is the only allowed dual reduction. To maintain an approximately feasible dual, 
this scheme is allowed to change phases (and reset duals) only when the primal cost increases by 
(say) a factor of two. Upon arrival of the first constraint (J^ieTi a n x i — 1)> the scheme produces a 
lower bound a\ = minjg^ Ci/an on the optimal value and begins its first phase. In the r th phase it 
is assumed that a r is the optimal value until the primal cost exceeds a r \ at this point the scheme 
sets a r +i = 2 • a r and enters phase r + 1. A competitive ratio of 0{(3) is proven via this scheme 
by showing that after each phase r, the total primal cost is at most /3 times the total dual value 
(added over all phases up to r). 

Lemma A.l Any online algorithm using the guess-and-double framework for covering LPs (even 
with k = 1 ) incurs an unbounded primal to dual ratio. 

Proof. It suffices to show that for every p > 2, there exist instances of the online covering LP with 
k = 1 where any algorithm using the guess-and-double framework incurs a primal to dual ratio of 
at least £l(p). Our instances will have all costs being one, so the primal objective is just YH=i x i- 
Since k = 1, all constraints will be of the form Xi >b for some i £ [n] and b > 0. The first constraint 
is x\ > p pJr2 . So ct\ = p p+2 in the guess-and-double scheme. In each phase r, constraints appear 
for a completely new set of variables follows. Initialize j <— 1. 

Sequence I(r,j) : Constraints of the form x r j > p h with dual variable y r j(h) appear 
for h = 0, 1, . . ., until the first time that algorithm sets dual value y r j(h) < p h ~ x . 

At this point we move on to the next variable x rj '+i, i.e. set j <— j + 1 and repeat the sequence 
I(r,j). Also, the entire phase r ends when the sum of variables in this phase exceeds a r , at which 
point we abort the current sequence I(r,j) and enter phase r + 1. 

Suppose q variables are used in phase r. Let h\, . . . , h q denote the number of constraints produced in 
I(r, 1), . . . , I(r, q) respectively. Note that the dual variables in this phase are [jj^ q ^{yr,j(h) : 1 < h < 
hj}, dual constraints are J2 h y r j(h)/p h < 1 for all j € [q], and dual objective is Ylje[q] J2hVr,j{h)- 

Claim 4 For all j E [q], hj < p + 1. 

Proof. Fix any j G [q]; the dual constraint corresponding to variable x T j reads ^2hUr,j{h)/ p h < 1. 
By definition of the sequence I(r,j), for all 1 < h < hj the dual value y r j(h) > p h ~ 1 . Note that 
duals in a single phase are monotone- so at the end of sequence I(r,j) we have: 

i > EyrAh)/ P h > eV 1 // = ^ 

h h=l ? 

The first inequality is the dual constraint for x r j and the second uses the dual values. ■ 

From this claim it follows that the primal increase of each x r j is at most p p+1 <a\jp< a,./ p. This 
implies that q > 2 variables are used in this phase. Note that the primal increase in phase r is: 

P( r ) = J2p Hj ^ ar ( A - 7 ) 
The next claim shows that the dual increase can only be a small fraction of the primal. 
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Claim 5 The total dual increase in phase r is at most | • P(r). 
Proof. Consider any primal variable x r j, and its dual constraint 

Ylh=i Vr,j{h)/p h < 1. Clearly the maximum dual value achievable from these dual variables 

Now consider j < q— 1; the sequence I(r,j) was ended due to y r ,j{hj) < p h] ~ x . Also by the dual 
constraint, y r j(h) < p h for all 1 < h < hj — 1. Thus: 



1 + 



1 



1-1/p 



< 3 • p h i-\ 



h=l h=l 

where the last inequality uses p > 2. We now obtain that the total dual value in phase r: 



hi 



-1 hj 



q-1 



i=l /i=l j=l h=l j=l 



<(0) /« + T -P(r) < 



3 



Claim 4 



p p+1 + 



P(r) 



This proves the claim. 



Using Claim 5| and ( |A.7| ) it follows that for the input sequence constructed above, the total dual 
value accrued ^ r (j2j h Vr,j{h)j is at most 4/ p times the primal cost Y2 r P( r )- B 

This lemma shows that using just the dual reductions allowed within a guess-and-double framework 
is insufficient to prove a primal-dual ratio independent of n. Instead our online algorithm performs 
more sophisticated dual reduction that is used to prove 0(log /^-competitiveness. 
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